Recent advances in Apertium, a free/open-source rule-based machine translation platform for low-resource languages
نویسندگان
چکیده
Abstract This paper presents an overview of Apertium, a free and open-source rule-based machine translation platform. Translation in Apertium happens through pipeline modular tools, the platform continues to be improved as more language pairs are added. Several advances have been implemented since last publication, including some new optional modules: module that allows rules process recursive structures at structural transfer stage, deals with contiguous discontiguous multi-word expressions, resolves anaphora aid translation. Also highlighted is hybridisation statistical modules augment pipeline, methods existing modules. includes morphological disambiguation, weighted transfer, lexical selection learn from limited data. The also discusses how like can critical part access technology for so-called low-resource languages, which might ignored or deemed unapproachable by popular corpus-based technologies. Finally, released unreleased pairs, concluding brief look supplementary tools prove valuable users well developers. All Apertium-related code, data, free/open-source available https://github.com/apertium .
منابع مشابه
Apertium goes SOA: an efficient and scalable service based on the Apertium rule-based machine translation platform
Service Oriented Architecture (SOA) is a paradigm for organising and using distributed services that may be under the control of different ownership domains and implemented using various technology stacks. In some contexts, an organisation using an IT infrastructure implementing the SOA paradigm can take a great benefit from the integration, in its business processes, of efficient machine trans...
متن کاملNeural machine translation for low-resource languages
Neural machine translation (NMT) approaches have improved the state of the art in many machine translation settings over the last couple of years, but they require large amounts of training data to produce sensible output. We demonstrate that NMT can be used for low-resource languages as well, by introducing more local dependencies and using word alignments to learn sentence reordering during t...
متن کاملTradubi: Open-Source Social Translation for the Apertium Machine Translation Platform
Massive online collaboration could become a winning strategy to tear down the language barriers on the web, and in order for this to happen appropriate computer tools, like reliable machine translation systems and friendly postediting interfaces, should be widely available. However, community collaboration should not only involve the postediting of machine translations, but also the creation of...
متن کاملSharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium
In this paper, we describe two methods developed for sharing linguistic data between two free and open source rule based machine translation systems: Apertium, a shallow-transfer system; and Grammatical Framework (GF), which performs a deeper syntactic transfer. In the first method, we describe the conversion of lexical data from Apertium to GF, while in the second one we automatically extract ...
متن کاملFree/Open-Source Resources in the Apertium Platform for Machine Translation Research and Development
This paper describes the resources available in the Apertium platform, a free/open-source framework for creating rule-basedmachine translation systems. Resources within the platform take the form of finite-state morphologies for morphological analysis and generation, bilingual transfer lexica, probabilistic part-of-speech taggers and transfer rule files, all in standardised formats. These resou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Translation
سال: 2021
ISSN: ['0922-6567', '1573-0573']
DOI: https://doi.org/10.1007/s10590-021-09260-6